135 research outputs found

    Fast Algorithms for Constructing Maximum Entropy Summary Trees

    Full text link
    Karloff? and Shirley recently proposed summary trees as a new way to visualize large rooted trees (Eurovis 2013) and gave algorithms for generating a maximum-entropy k-node summary tree of an input n-node rooted tree. However, the algorithm generating optimal summary trees was only pseudo-polynomial (and worked only for integral weights); the authors left open existence of a olynomial-time algorithm. In addition, the authors provided an additive approximation algorithm and a greedy heuristic, both working on real weights. This paper shows how to construct maximum entropy k-node summary trees in time O(k^2 n + n log n) for real weights (indeed, as small as the time bound for the greedy heuristic given previously); how to speed up the approximation algorithm so that it runs in time O(n + (k^4/eps?) log(k/eps?)), and how to speed up the greedy algorithm so as to run in time O(kn + n log n). Altogether, these results make summary trees a much more practical tool than before.Comment: 17 pages, 4 figures. Extended version of paper appearing in ICALP 201

    Visualizing Multivariate Hierarchic Data Using Enhanced Radial Space-Filling Layout

    Get PDF
    Currently, visualization tools for large ontologies (e.g., pathway and gene ontologies) result in a very flat wide tree that is difficult to fit on a single display. This paper develops the concept of using an enhanced radial space-filling (ERSF) layout to show biological ontologies efficiently. The ERSF technique represents ontology terms as circular regions in 3D. Orbital connections in a third dimension correspond to non-tree edges in the ontology that exist when an ontology term belongs to multiple categories. Biologists can use the ERSF layout to identify highly activated pathway or gene ontology categories by mapping experimental statistics such as coefficient of variation and overrepresentation values onto the visualization. This paper illustrates the use of the ERSF layout to explore pathway and gene ontologies using a gene expression dataset from E. coli

    Visual Analysis of Multi-Joint Kinematic Data

    Get PDF
    Abstract Kinematics is the analysis of motions without regarding forces or inertial effect

    The generalized Robinson-Foulds metric

    Get PDF
    The Robinson-Foulds (RF) metric is arguably the most widely used measure of phylogenetic tree similarity, despite its well-known shortcomings: For example, moving a single taxon in a tree can result in a tree that has maximum distance to the original one; but the two trees are identical if we remove the single taxon. To this end, we propose a natural extension of the RF metric that does not simply count identical clades but instead, also takes similar clades into consideration. In contrast to previous approaches, our model requires the matching between clades to respect the structure of the two trees, a property that the classical RF metric exhibits, too. We show that computing this generalized RF metric is, unfortunately, NP-hard. We then present a simple Integer Linear Program for its computation, and evaluate it by an all-against-all comparison of 100 trees from a benchmark data set. We find that matchings that respect the tree structure differ significantly from those that do not, underlining the importance of this natural condition.Comment: Peer-reviewed and presented as part of the 13th Workshop on Algorithms in Bioinformatics (WABI2013

    Information Visualisation for Project Management: Case Study of Bath Formula Student Project

    Get PDF
    This paper contributes to a better understanding and design of dashboards for monitoring of engineering projects based on the projects’ digital footprint and user-centered design approach. The paper presents an explicit insight-based framework for the evaluation of dashboard visualisations and compares the performance of two groups of student engineering project managers against the framework: a group with the dashboard visualisations and a group without the dashboard. The results of our exploratory study demonstrate that student project managers who used the dashboard generated more useful information and exhibited more complex reasoning on the project progress, thus informing knowledge of the provision of information to engineers in support of their project understanding

    An end-user pipeline for scrapping and visualizing semi-structured data over the Web

    Get PDF
    The Web is a vast source of semi-structured data sets that are made readily available to support the construction of new knowledge. Information visualization techniques have been demonstrated a suitable alternative for allowing users to analyze and understand a large amount of data. However, the steps required for visualizing semi-structured data obtained from the Web is not straightforward, and it requires proper treatment before information visualization techniques could be applied. In this work, we present a visualization pipeline for describing the fundamental operations required for visualizing semi-structured data over the Web. For that, we employ Web Scrapping and Web Augmentation techniques for supporting interactive visualizations and solving tasks without changing the context of use of the data. Our approach is duly supported by a framework including scrapping, augmenting and visualization tools and it has been applied to different kinds of websites to demonstrate its validity and feasibility. Our ultimate goal is to expand the limits of our technology for improving the user interaction with websites and creating new experiences for better understanding large data sets

    A Visual Analytics Framework Case Study: Understanding Colombia’s National Administrative Department of Statistics Datasets

    Get PDF
    In a world filled with data, it is expected for a nation to take decisions informed by data. However, countries need to first collect and publish such data in a way meaningful for both citizens and policy makers. A good thematic classification could be instrumental in helping users to navigate and find the right resources on a rich data repository, such as the one collected by the DANE (Departamento Administrativo Nacional de Estadística, i.e. the Colombia’s National Administrative Department of Statistics). The Visual Analytics Framework is a methodology for conducting visual analysis developed by T. Munzner et al.1 that could help with this task. This paper presents a case study applying such framework conducted to help the DANE to better visualize their data repository, and also to understand it better by using another classification extracted from its metadata. It describes the three main analysis tasks identified and the proposed solutions. Usability testing results during the process helped to correct the visualizations and make them adapted to decision-making. Finally, we explained the collection of insights generated from them

    Generating Dashboards Using Fine-Grained Components: A Case Study for a PhD Programme

    Get PDF
    Developing dashboards is a complex domain, especially when several stakeholders are involved; while some users could demand certain indicators, other users could demand specific visualizations or design features. Creating individual dashboards for each potential need would consume several resources and time, being an unfeasible approach. Also, user requirements must be thoroughly analyzed to understand their goals regarding the data to be explored, and other characteristics that could affect their user experience. All these necessities ask for a paradigm to foster reusability not only at development level but also at knowledge level. Some methodologies, like the Software Product Line paradigm, leverage domain knowledge and apply it to create a series of assets that can be composed, parameterized, or combined to obtain fully functional systems. This work presents an application of the SPL paradigm to the domain of information dashboards, with the goal of reducing their development time and increasing their effectiveness and user experience. Different dashboard configurations have been suggested to test the proposed approach in the context of the Education in the Knowledge Society PhD programme of the University of Salamanca
    • …
    corecore